Added performance analysis as a feature with AutoModelForCausalLM by ochougul · Pull Request #888 · quic/efficient-transformers

ochougul · 2026-03-25T20:04:52Z

Summary

Added evaluate_performance(...) to QEFFAutoModelForCausalLM for end-to-end performance analysis: compile + qaic-runner + qaic-opstats.
Compile perf flags are always enabled: aic_perf_metrics=True, aic_perf_warning=True; for raw_device_stats, also force stats_level=70, ddr_stats=True, aic_pmu_recipe="KernelUtil".
Added prefill_only to evaluate_performance(...) and now forward it to compile(...).

Key Behavior Changes

Stage selection is now:

prefill_only=True -> prefill-only
prefill_seq_len==1 -> decode-only
otherwise -> both prefill + decode

Artifacts and Paths

Standardized output layout: compile/, io/, performance_analysis/.
Added per-stage subdirs for prefill/decode under io, profiling, runner_outputs, and opstats.

Validation

Expanded tests in tests/unit_test/utils/test_auto_model_api.py; status: 70 passed.
Hardware smoke verified both:
--prefill-only -> only prefill artifacts
--prompt-len 1 (without --prefill-only) -> only decode artifacts

…ausalLM class Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>

Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>

ochougul added 3 commits March 26, 2026 00:12

added way to get performance stats automtically in qeff AutoModelForC…

a47b6e6

…ausalLM class Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>

removed redundancies

160b819

Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>

added prefill-only flag and prefill/decode run by default

aec6eae

Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>

ochougul requested a review from anujgupt-github March 25, 2026 20:06

ochougul self-assigned this Mar 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added performance analysis as a feature with AutoModelForCausalLM#888

Added performance analysis as a feature with AutoModelForCausalLM#888
ochougul wants to merge 3 commits intomainfrom
get_perf

ochougul commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ochougul commented Mar 25, 2026

Summary

Key Behavior Changes

Artifacts and Paths

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant